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Abstract 

We describe a method for analysing 
the temporal structure of a discourse 
which takes into account the effects 
of tense, aspect, temporal adverbials 
and rhetorical structure and which 
minimises unnecessary ambiguity in 
the temporal structure. It is part 
of a discourse grammar implemented 
in Carpenter's ale formalism. The 
method for building up the tempo- 
ral structure of the discourse com- 
bines constraints and preferences: we 
use constraints to reduce the number 
of possible structures, exploiting the 
HPSG type hierarchy and unification 
for this purpose; and we apply prefer- 
ences to choose between the remain- 
ing options using a temporal center- 
ing mechanism. We end by recom- 
mending that an underspecified repre- 
sentation of the structure using these 
techniques be used to avoid generat- 
ing the temporal/rhetorical structure 
until higher-level information can be 
used to disambiguate. 



1 Introduction 

In this paper we describe a method for 
analysing the temporal structure of a discourse. 
This component was implemented as part of a 
discourse grammar for English. The goals of 



the temporal component were to yield a de- 
tailed representation of the temporal structure 
of the discourse, taking into account the effect 
of tense, aspect and temporal expressions while 
at the same time minimising unnecessary am- 
biguity in the temporal structure. The method 
combines a constraint-based approach with an 
approach based on preferences: we exploit the 
HPSG type hierarchy and unification to arrive at 
a temporal structure using constraints placed 
on that structure by tense, aspect, rhetorical 
structure and temporal expressions, and we use 
the temporal centering pref e rences descri bed 
by ( Kameyama et al., 1993 ; Poesio, 1994 ) to 
rate the possibilities for temporal structure and 
choose the best among them. 

The starting point for this work was Scha 
and Polanyi's discourse grammar (Scha & 
Polanyi 1988; Prust et al 1994). For the 
implementation we extended the HPSG gram- 
mar (Pollard and Sag, 1994) which Gerald 
Penn and Bob Carpenter first encoded in 
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ale (Carpenter, 1993). This paper will focus 
on our temporal processing algorithm, and in 
particular on our analysis of narrative progres- 
sion, rhetorical structure, perfects and tempo- 
ral expressions. 

2 Constraints on narrative 
continuations 

Probably the best known algorithm for track- 
ing narrative progression is that developed 
by Kamp fll979| ), Hinrichs fll98l| ), and Par- 
tee ( [1984 ), which formalises the observation 
that an event will occur just after a preceding 
event, while a state will overlap with a pre- 
ceding event. This algorithm gives the correct 
results in examples such as the following: 

(1) John entered the room. Mary stood up. 



(2) John entered the room. Mary was 
seated behind the desk. 

In (|l|) the event of Mary's standing is under- 
stood to occur just after John enters the room, 
while in (^) the state in which Mary is seated is 
understood to overlap with the event of John's 
entering the room. 

However, if there is a rhetorical relationship 
between two eventualities such as causation, 
elaboration or enablement, the temporal de- 
faults can be overridden, as in the following 
examples: 

(3) a. John fell. Mary pushed him. 

b. Local builders constructed the Ford 
St. Bridge. They used 3 tons of 
bricks. 

In (|3|a) there is a causal relationship between 
Mary's pushing John and his falling, and the 
second event is understood to precede the first. 
In (|]b), the second sentence is an elaboration 
of the first, and they therefore refer to aspects 
of the same event rather than to two sequential 
events. 

It has been suggested that only world knowl- 
edge allows one to detect that the default is 
being overridden here. For example, Lascarides 
& Asher (1991) suggest that general knowledge 
postulates (in the case of (||a): that a pushing 
can cause a falling) can be invoked to generate 
the backward movement reading. 

The problem for practical systems is twofold: 
we could assume that in the case of narrative 
the Kamp/Hinrichs/Partee algorithm is the de- 
fault, but each time the default is applied we 
would need to check all our available world 
knowledge to see whether there isn't a world 
knowledge postulate which might be overriding 
this assumption. Clearly this would make the 
processing of text a very expensive operation. 

An alternative is to assume that the tempo- 
ral ordering between events in two consecutive 
sentences can be any of the four possibilities 
(Just-after, precede, same-event and overlap). 
But then the resulting temporal structures will 
be highly ambiguous even in small discourses. 
And sometimes this ambiguity is unwarranted. 
Consider: 

(4) Mary stared at John. He gave her back 
her slice of pizza. 



Here, it would appear, only one reading is pos- 
sible, i.e. the one where John gave Mary her 
slice of pizza just after she stared or started to 
stare at him. It would be undesirable for the 
temporal processing mechanism to postulate an 
ambiguity in this case. 

Of course, sometimes it is possible to take 
advantage of certain cue words which either in- 
dicate or constrain the rhetorical relation. For 
example, in (|^) the order of the events is un- 
derstood to be the reverse of that in (Q) due 
to the cue word because which signals a causal 
relationship between the events: 

(5) John entered the room because Mary 
stood up. 



As Kehler ( 1994 ) points out, if forward move- 
ment of time is considered a default with con- 
secutive event sentences, then the use of "be- 
cause" in (||) should cause a temporal clash — 
whereas it is perfectly felicitous. Temporal 
expressions such as at noon and the previous 
Thursday can have a similar effect: they too 
can override the default temporal relations and 
place constraints on tense. In (^), for exam- 
ple, the default interpretation would be that 
John's being in Detroit overlaps with his being 
in Boston, but the phrase the previous Thurs- 
day overrides this, giving the interpretation 
that John's being in Detroit precedes his be- 
ing in Boston: 

(6) John was in Boston. The previous 
Thursday he was in Detroit. 

This suggests that the temporal information 
given by tense acts as a weaker constraint on 
temporal structure than the information given 
by temporal adverbials. 

The possibilities for rhetorical relations (e.g., 
whether something is narration, or elaboration, 
or a causal relation) can be further constrained 
by aspect. For example, a state can elaborate 
another state or an event: 

(7) a. Mary was tired. She was exhausted, 
b. Mary built a dog house. It was a 

labour of love. 
But an event can only elaborate another event, 
as in (H) : 

(8) a. Mary built a dog house. She used two 

tons of bricks, 
b. Mary was tired/working hard. ?She 
built a dog house. 



Table 1. Possible relations when S2 expresses a simple past event. 



Si 


Relation 


Example 


past event 


just-after Si 
precede Si 
overlap Si 
same-event Si 


Mary pushed John. He fell. 
John fell. Mary pushed him. 

NO 

I assembled the desk myself. The drawers only took me ten 
minutes. 


past activity 


just-after Si 
precede Si 
overlap Si 
same-event Si 


Mary stared at John. He gave her back her slice of pizza. 

NO 
NO 
NO 


past state 


just-after Si 
just-after TFi 
precede Si 
precede TFi 
overlap Si 
overlap TFi 
same-event Si 
same-event TFi 


NO 

Sam arrived at eight. He was tired. He rang the bell. 

NO 

?John fell. He was in pain. Mary pushed him. 
Mary was angry. She pushed John. 

NO 
NO 

I assembled the desk myself. It was beautiful. The drawers only 
took me ten minutes. 


past perf event 


just-after Si 
precede Si 
overlap Si 
same-event Si 


Sam had arrived at the house. He rang the bell. 

Sam arrived at the house. He had lost the key. He rang the bell. 

NO 

I had assembled the desk myself. The drawers only took me ten 
minutes. 


past perf activity 


just-after Si 
precede Si 
overlap Si 
same-event Si 


Mary had stared at John. He gave her back her slice of pizza. 

NO 
NO 
NO 


past perf state 


just-after Si 
just-after TFi 

precede Si 
precede TFi 
overlap Si 

overlap TFi 
same-event Si 
same-event TFi 


NO 

Martha discovered the broken lock. Someone had been in the 
garage. They rearranged the tools. 

NO 
NO 

Martha discovered the broken lock. Someone had been in the 
garage. They rearranged the tools. 

NO 
NO 

Mary built the desk herself. She had been happy taking it on. 
The drawers only took her ten minutes. 



For the eventive second sentence of (|8|b) to be 
an elaboration of the first sentence, it must oc- 
cur in a stative form — for example as a progres- 
sive (i.e., She was building a dog house). 

Because of considerations like these, our aim 
in the implementation work was to treat tense, 
aspect, cue words and rhetorical relations as 
mutually constraining, with more specific infor- 
mation such as explicit cue words having higher 
priority than less specific information such as 
tense. The main advantage of this approach 
is that it reduces temporal structure ambiguity 
without having to rely on detailed world knowl- 
edge postulates. 

Table 1 lists the possible temporal relations 
between the eventualities described by two con- 



secutive sentences without temporal expres- 
sions or cue words, where the first sentence 
(Si) may have any tense and aspect and the 
second sentence (S2) expresses a simple past 
event. We constrain S2 in this way because of 
lack of space; additional constraints are given 
in (Hitzcman et ah, 1994). For example, if 
a simple past eventive sentence follows a sim- 
ple past eventive sentence the second event can 
be understood to occur just after the first, to 
precede the first or to refer to the same event 
as the first (an elaboration relation), but the 
two events cannot overlap; these constraints 
are weaker, however, than explicit clues such 
as cue words to rhetorical relations and tempo- 
ral expressions. When Si expresses a state, it 



is possible for the temporal relation to hold be- 
tween the event described by S2 and the event 
or activity most closely preceding Si, i.e., the 
temporal focus of Si, here referred to as TFi.f] 



However, we haven't solved the problem 



For the problem with multi-sentence dis- 
courses, and the "threads" that sentences con- 
tinue, we use an implementation of tempo- 
ral centering (Kameyama et al., 1993; PoesioJ 
1994). This is a technique similar to the type of 



completely at this point: although tense can centering used for nominal anaphora ( |5idncr 



provide a further constraint on the temporal 1983; Grosz et al., 1983). Centering assumes 



structure of such discourses, it can also add a 
further ambiguity. Consider (Q): 

(9) Sam rang the bell. He had lost the key. 

Clearly, the event described by the past perfect 
sentence must precede the event described by 
the first, simple past sentence. However, if a 
third sentence is added, an ambiguity results. 
Consider the following possible continuations 
of @ : 

(10) a. ...Hannah opened the door. 

b. ...It fell through a hole in his pocket. 

The temporal relation between these continu- 
ations and the portion of earlier text they at- 
tach to is constrained along the lines sketched 
before. The problem here is determining which 
thread in (||) they continue; (|l(]a) continues the 
thread in which Sam rings the bell, but (JTo|b) 
continues the thread in which Sam loses the 
key. 

A further ambiguity is that when the third 
sentence is past perfect, it may be a continua- 
tion of a preceding thread or the start of a new 
thread itself. Consider: 

(11) a. Sam rang the bell. He had lost the 

key. It had fallen through a hole in 
his pocket. 

b. John got to work late. He had left 
the house at 8. He had eaten a big 
breakfast. 

In (a) the third sentence continues the thread 
about losing the key; in (b) the third starts a 
new thread. n 



1 In this chart it appears that whether the tense 
is simple past or past perfect makes no difference, 
and that only aspect affects the possible tempo- 
ral relations between Si and S2. However, it is 
important not to ignore tense because other com- 
binations of tense and aspect do show that tense 
affects which relations are possible, e.g., a simple 
past stative S2 cannot have a precede relation with 
any Si, while a past perfect stative S2 can. 

2 We will not discuss the additional problem that 
if the final sentence in (nib) is the end of the text, 
the text is probably ill-formed. This is because 



that discourse understanding requires some no- 
tion of "aboutness." While nominal centering 
assumes there is one object that the current dis- 
course is "about," temporal centering assumes 
that there is one thread that the discourse is 
currently following, and that, in addition to 
tense and aspect constraints, there is a prefer- 
ence for a new utterance to continue a thread 
which has a parallel tense or which is seman- 
tically related to it and a preference to con- 
tinue the current thread rather than switching 
to another thread. Kameyama et al. (1993) 
confirmed these preferences when testing their 
ideas on the Brown corpus. 

As an example of how the temporal centering 
preference techniques can reduce ambiguity, re- 
call example ([)]) and the possible continuations 
shown in (|l^) . The difficulty in these examples 
is determining whether the third sentence con- 
tinues the thread begun by the first or second 
sentence. For example, in (JToja) the preference 
technique which allows us to choose the first 
thread over the second is one which assigns a 
higher rating to a thread whose tense is parallel 
to that of the new sentence; in this case both 
Sam rang the bell and Hannah opened the door 
are in the simple past tense. In example (flo|b) 
the fact that the key is mentioned only in the 
second sentence of ^ links (|Io|b) with the sec- 
ond thread. To handle an example like (|lj), we 
employ a preference for relating a sentence to 
a thread that has content words that are rated 
as semantically "close" to that of the sentence: 

(12) Sam rang the bell. He had lost the key. 
His keyring broke. 

We store semantic patterns between words 
as a cheap and quick form of world knowl- 
edge; these patterns are easier to provide than 
are the detailed world knowledge postulates re- 
quired in some other approaches, and result in 
similar and sometimes more precise temporal 



a well-formed text should not leave threads "dan- 
gling" or unfinished. This is probably also the rea- 
son for the awkwardness of the well-known exam- 
ple Max poured a cup of coffee. He had entered the 
room. 



rhet_reln 



just_before overlaps same_event precedes no_temp_reln 

sequences causes background elaboration results reverse_sequence contrast list enumeration 

Figure 1: The type hierarchy used for constraints 



structures with less processing overhead. Using 
the semantic patterns we know that key and 
keyring are semantically close, and through 
that semantic link between the second and 
third sentences we prefer to connect the third 
sentence to the thread begun by the second.^ 
The approach to representing semantic rela- 
tionshi ps w e take is one used by Morris & 
Hirst ( |l99l| ) wherein the words in the lexicon 
are associated with each other in a thesaurus- 
like fashion and given a rating according to how 
semantically "close" they are. We thus avoid 
relying on high-level inferences and very spe- 
cific world knowledge postulates, our goal being 
to determine the temporal structure as much as 
possible prior to the application of higher-level 
inferences. 

3 An HPSG implementation of a 
discourse grammar 

Following Scha & Polanyi ( |1988[ ) and Priist 
et al (1994), our model of discourse consists 
of units called Discourse Constituent Units 
(DCUs) which are related by various temporal 
and rhetorical relations. A basic DCU repre- 
sents a sentence (or clause) , and complex DCUs 
are built up from basic and complex DCUs. 

In our ale implementation, a DCU contains 
the following slots for temporal information: 

CUE_word: Cues to rhetorical structure, e.g., 
"because." 

V_AND_NP_LIST: Contains content words found 
in this DCU, and is used to compare the 
content words of the current DCU with 
those in previous threads, in order to rate 
the semantic "closeness" of the dcu to 
each thread. 

3 Semantic closeness ratings won't help in exam- 
ples (0) - ( |lo| ) because there is as strong a relation- 
ship between door and bell as there is between door 
and key. 



SEM_ASPECT: Contains the semantic aspect 
{event, state, activity). We have extended 
the Penn & Carpenter implementation of 
the HPSG grammar so that semantic aspect 
is calculated compositionally (and stored 
here) . 

RHET_RELN: The relation between this DCU 
and a previous one. Lexical items and 
phrases such as cue words (stored in 
CUE_word) affect the value of this slot. 

temp_CENTER: Used for temporal centering; 
Keeps track of the thread currently be- 
ing followed (since there is a preference for 
continuing the current thread) and all the 
threads that have been constructed so far 
in the discourse. 

fwd .center: Existing threads 
BKWD_CENTER: The thread currently be- 
ing followed 
CLOSED_threads: Threads no longer 
available for continuation 

temp_expr_RELNS: Stores the semantic inter- 
pretation of temporal expressions associ- 
ated with this DCU. 

temp_relns: Stores the temporal relations 
between the eventualities in the discourse. 

TEMPFOC: The most recent event in the cur- 
rent thread which a subsequent eventuality 
may elaborate upon {same-event), overlap, 
come just_after or precede. 

tenasp: Keeps track of the tense and syntactic 
aspect of the DCU (if the DCU is simple). 

tense: past, pres, fut 

aspect: simple, perf, prog, perLprog 

To allow the above-mentioned types of infor- 
mation to mutually constrain each other, we 
employ a hierarchy of rhetorical and tempo- 
ral relations (illustrated in Figure 1), using the 



ale system in such a way that clues such as 
tense and cue words work together to reduce 
the number of possible temporal structures. 
This approach improves upon earlier work on 



discourse structure such as (Lascarides and 



Asher, 1991) and (Kehler, 1994) in reducing 



the number of possible ambiguities; it is also 
more precise than the Kamp/Hinrichs/Partee 
approach in that it takes into account ways 
in which the apparent defaults can be overrid- 
den and differentiates between events and ac- 
tivities, which behave differently in narrative 
progression. 

Tense, aspect, rhetorical relations and tem- 
poral expressions affect the value of the 
rhet_reln type that expresses the relation- 
ship between two DCUs: cue words are lexically 
marked according to what rhetorical relation 
they specify, and this relation is passed on to 
the DCU. Explicit relation markers such as cue 
words and temporal relations must be consis- 
tent and take priority over indicators such as 
tense and aspect. For example, sentence ( ^3| ) 
will be ruled out because the cue phrase as a re- 
sult conflicts with the temporal expression ten 
minutes earlier. 

(13) #Mary pushed John and as a result ten 
minutes earlier he fell. 

On the other hand, if temporal expressions in- 
dicate an overlap relation and cue words indi- 
cate a background relation as in (O), these con- 
tributions are consistent and the R,het_reln 
type will contain a background value (the more 
specific value of the two): 

(14) Superman stopped the train just in 
time. Meanwhile, Jimmy Olsen was in 
trouble. 

4 The algorithm 

For reasons of space it is difficult to give ex- 
amples of the sign-based output of the gram- 
mar, or of the ale rules, so we will restrict 
ourselves here to a summary of the algorithm 
and to a very limited rendition of the system 
output. The algorithm used for calculating the 
temporal structure of a discourse can be sum- 
marised as follows. It consists of two parts, the 
constraint-based portion and the preference- 
based portion: 

1. The possible temporal/rhetorical relations 
are constrained. 



(a) If there is a temporal expression, it 
determines the temporal relationship 
of the new DCU to the previous ones, 
and defaults are ignored. 

(b) Lexical items such as cue words influ- 
ence the value of the rhet_reln type 
(See Figure 1). 

(c) If steps (a) and (b) attempt to place 
conflicting values in the rhet_RELN 
slot, the parse will fail. 

(d) If there is no temporal expression or 
cue phrase, tense and semantic as- 
pect also influence the value of the 
RHET_RELN type (See Table 1), so 
that rhetorical relations, tense and as- 
pect constrain each other. 

2. If more than one possibility exists, seman- 
tic preferences are used to choose between 
the possibilities. 

(a) A "semantic distance" rating between 
the new DCU and each previous thread 
is determined. (If there are no exist- 
ing threads a new thread is started.) 

(b) Other preferences, such as a prefer- 
ence for relating the new DCU to a 
thread with parallel tense, arc em- 
ployed (See ( Kameyama et al., 1995 ; 



Poesio, 1994| ) for details), and the re- 



sulting ratings are factored into the 
rating for each thread. 

(c) If the thread currently being followed 
is among the highest rated threads, 
this thread is continued. (This corre- 
sponds to temporal centering's prefer- 
ence to continue the current thread.) 

(d) If not, the DCU may continue any of 
the highest rated threads, and each of 
these solutions is generated. 

Charts such as Table 1 provide the observa- 
tions we use to fill in the value of rhet_reln. 
Those observations are summarised below. In 
what follows, the event variable associated with 
DCUj is e^ and the tempfoc of e! is the most 
recent event /activity processed, possibly ei it- 
self: 

• e2 can overlap with ei if 

— DCU2 describes a state, or 

— DCUi describes a state and DCU2 de- 
scribes an activity. 



• e2 can occur just-after the tempfoc of ei 
if 

— DCU2 describes a simple tense event, 
or 

— DCUi describes a complex tense clause 
and DCU2 describes a complex tense 
event, or 

— DCUi describes an event and DCU2 
describes an atelic or a simple tense 
state, or 

— DCUi describes a state and DCU2 de- 
scribes a simple tense activity. 

• e2 can precede ei if 

— DCU2 describes an event, or 

— DCUi doesn't describe an activity and 
DCU2 describes a past perfect stative. 

• e2 can elaborate on ei if 

— DCUi describes an event, or 

— DCUi describes an activity and DCU2 
describes an atelic, or 

— DCUi and DCU2 describe states and 
either DCU2 describes a simple tense 
state or DCUi describes a complex 
tense state. 

Using this algorithm, we can precisely iden- 
tify the rhetorical and temporal relations when 
cue words to rhetorical structure are present, 
as in (p"5|): 

(15) John fell (ei) because Mary pushed him 
(e 2 ). 

temp_relns: e2 precedes ei 

We can also narrow the possibilities when no 
cue word is present by using constraints based 
on observations of tense and aspect interactions 
such as those shown in Table 1. For example, if 
DCUi represents a simple past eventive sentence 
and DCU2 a past perfect eventive sentence, then 
in spite of the lack of rhetorical cues we know 
that e2 precedes ei, as in jl^): 

(16) Sam rang the doorbell (ei). He had lost 
the key (ea). 

temp_relns: e2 precedes ei 

Also, when several structures are possible we 
can narrow the possibilities by using prefer- 
ences, as in the examples below: 



(17) Sam arrived at the house at eight (ei). 
He had lost the key (e2). 

a. ...He rang the bell (63). 
temp_relns: e2 precedes ei, 

e3 just-after ei 

b. ...It fell through a hole in his pocket 

temp_relns: e 2 precedes ei, 
63/ just-after e2 

If we allow any of the four possible temporal 
relations between events, both continuations of 
sentence ( |l7j ) would have 17 readings (4x4 + 
1 reading in which the third sentence begins a 
new thread). Using constraints, we reduce the 
number of readings to 4. Using preferences, 
we reduce that to 2 readings for each continua- 
tion. The correct temporal relations are shown 
in ©.0 

5 An underspecified 
represent at ion 

By using constraints and preferences, we can 
considerably reduce the amount of ambiguity 
in the temporal/rhetorical structure of a dis- 
course. However, explicit cues to rhetorical and 
temporal relations are not always available, and 
these cases result in more ambiguity than is de- 
sirable when processing large discourses. 

Consider, however, that instead of generat- 
ing all the possible temporal/rhetorical struc- 
tures, we could use the information available to 
fill in the most restrictive type possible in the 
type hierarchy of temporal/rhetorical relations 
shown in Figure 1. We can then avoid generat- 
ing the structures until higher- level information 
can be applied to complete the disambiguation 
process. 

6 Conclusion 

We presented a brief description of an algo- 
rithm for determining the temporal structure 
of discourse. The algorithm is part of an HPSG- 
style discourse grammar implemented in Car- 
penter's ale formalism. Its novel features are 
that it treats tense, aspect, temporal adverbials 
and rhetorical relations as mutually constrain- 
ing; it postulates less ambiguity than current 

4 The other reading, in which the third sentence 
is an elaboration of one of the preceding events, 
must not be ruled out because there are cases such 
as Sam arrived at the house at eight. He rang the 
bell. He let it ring for two minutes, in which such 
elaboration is possible. 



temporal structuring algorithms do; and it uses 
semantic closeness and other preference tech- 
niques rather than full-fledged world knowl- 
edge postulates to determine preferences over 
remaining ambiguities. We also recommended 
using an underspecified representation of tem- 
poral/rhetorical structure to avoid generating 
all solutions until higher-level knowledge can 
aid in reducing ambiguity. 
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